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Abstract 

We deal with the equivariant estimation of scatter and location for 
p-dimensional data, giving emphasis to scatter. It is important that the 
estimators possess both a high efficiency for normal data and a high re¬ 
sistance to outliers, that is, a low bias under contamination. The most 
frequently employed estimators are not quite satisfactory in this respect. 
The Minimum Volume Ellipsoid (MVE) and Minimum Covariance De¬ 
terminant (MCD) estimators are known to have a very low efficiency. 
S-Estimators with a monotonic weight function like the bisquare have a 
low efficiency for small p, and their efficiency tends to one with increasing 
p. Unfortunately, this advantage is paid for by a serious loss of robustness 
for large p. 

We consider four families of estimators with controllable efficiencies 
whose performance for moderate to large p has not been explored to date: 
S-estimators with a non-monotonic weight function (Rocke 1996), MM- 
estimators, r-estimators, and the Stahel-Donoho estimator. Two types of 
starting estimators are employed: the MVE computed through subsam¬ 
pling, and a semi-deterministic procedure proposed by Pena and Prieto 
(2007) for outlier detection. 

A simulation study shows that the Rocke estimator starting from the 
Pena-Prieto estimator and with an adequate tuning, can simultaneously 
attain high efficiency and high robustness for p > 15, and the MM esti¬ 
mator can be recommended for p<15. 

Keywords: MM-estimator, tau-estimator, S-estimator, Stahel-Donoho 
estimator, Kullback-Leibler divergence. 


1 Introduction 

Consider a sample X ={xi, ...,x„} C R p . We look for substitutes /I £ R p and 
S £ R pxp of the sample mean vector and covariance matrix, that are resistant 
to atypical observations. We also want estimators that have a high efficiency for 
normal samples. As a measure of robustness we consider not only the breakdown 
point but also the maximum expected Kullback-Leibler divergence between the 
estimator and the true value, under contamination. 
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The most frequently employed estimators are not quite satisfactory in this 
respect. The Minimum Volume Ellipsoid (MVE) and the Minimum Covariance 
Determinant (MCD) estimators (Rousseeuw 1985) are known to have a low 
efficiency. This efficiency can be increased by means of a “one-step reweighting”. 
Croux and Haesbroeck (1999, Tables VII and VIII) computed the finite-sample 
efficiencies of the reweighted MCD; although they are much higher than for 
the “raw” estimator, they are still low if one wants a high breakdown point. 
S-Estimators (Davies 1987) with a monotonic weight function like the bisquare 
have a low efficiency for small p. Rocke (1996) showed that their efficiency tends 
to one with increasing p; unfortunately, this advantage is paid for with a serious 
loss of robustness for large p. 

We restrict ourselves to equivariant estimators. There exist many non- 
equivariant proposals; but the comparison between equivariant and non-equivariant 
estimators is difficult. In particular, a non-equivariant estimator is more difficult 
to tune for a given efficiency, since the latter depends on the correlations. 

Among the published equivariant proposals, there are four families of esti¬ 
mators with controllable efficiencies: non-monotonic S-estimators (Rocke 1996), 
MM-estimators (Tatsuoka and Tyler 2000), r-estimators (Lopuhaa 1991) and 
the estimator proposed independently by Stahel (1981) and Donoho (1982) but 
their behavior for large dimensions has not been explored to date. We compare 
their behaviors employing different weight functions. A simulation study shows 
that the Rocke and MM estimators, with an adequate weight function and an 
adequate tuning, can simultaneously attain high efficiency and high robustness. 

It will be seen below that if we have a good E, it is easy to find a good 
equivariant p, but the converse is not true. For this reason we shall put more 
emphasis on the estimation of the scatter matrix. 

Since all the considered estimators are based on the iterative minimiza¬ 
tion of a non-convex function, the starting values are crucial. Subsampling 
is the standard way to compute starting values; but we shall see that a semi- 
deterministic equivariant procedure proposed by Pena and Prieto (2007) may 
yield both shorter computing times and better statistical performances. 

In Section [2] we describe monotonic M-estimators; Section [3] deals with esti¬ 
mators based on the minimization of a robust scale of Mahalanobis distances. 
Sections [4] and [5] deal with MM and Stahel-Donoho estimators respectively. In 
Section [6] we discuss the choice of the p —function for MM- and r—estimators. 
Section [7] deals with computational details. In Section [8] the estimators are 
compared through a simulation study. In Section [9] the estimators are applied 
to a real data set. Finally Section [TO] summarizes the results. Section EH is 
an Appendix containing the full results of the simulations, the approximations 
for the tuning constants and some details on the Rocke and the Pena-Prieto 
procedures. 
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2 Monotonic M-estimators 


For x, /i G RP and S GR pxp define the (squared) Mahalanobis distance as 


d (x, p, £) = (x — p)' £ 1 (x-p). 


Let IT be a bounded nonincreasing “weight function”. 
M-estimators (Maronna 1976) are defined as solutions of 

Then monotonic 

1 " 

~y2 w ( di ) (x - p) (x - p)' = S 

i—1 

(1) 

1 n 

-J2 W k)( x -m) = ° 

n z ' 

i =1 

(2) 

where for brevity we put 

di = d (xi, /i, 5]). 



The uniqueness of the solutions requires that W (d) d be nondecreasing. 
Unfortunately, this implies (Maronna, 1976) that the breakdown point is < 
1/ (p + 1), which makes these estimators unreliable except for small p. Besides, 
this fact holds even if p is known, while the asymptotic breakdown point of p 
with known S is 0.5 with an adequate W. This shows that the main problem to 
attain high robustness is the scatter matrix. 

3 Estimators based on the minimization of a ro¬ 
bust scale 

For d = (di,..., d n ) let S (d) be a robust scale. Put 

d(tb E) = (d(x 1 ,p,H),...,d(x 1 ,p,'E)). 

A general family of estimators can be defined by 

(fts) = argmin S (d (p, £)), p Gi? p , £ eR p , |S| = 1, (3) 

where the condition |S| = 1 rules out trivial solutions with £ —> oo. 

If S (d) = Median (d) we have the “Minimum Volume Ellipsoid” (MVE) 
estimator, and if S' is a trimmed mean, we have the “Minimum Covariance 
Determinant” (MCD) estimator, both proposed by Rousseeuw (1985). The 
first one is very robust, but has a null asymptotic efficiency; the second is 
very popular, but its asymptotic efficiency is very low; see (Paindaveine and 
Van Bever, 2014) and references therein, and its maximum contamination bias 
increases rapidly with p (Agostinclli et al, 2015, Table 1). 

The condition |£| = 1 means that we estimate the “shape” of the scatter. 
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Given the shape, the ’’size” can easily estimated to yield consistency at the 
normal model (Maronna et ah, Section 6.3.2). A simple way is to put 

Median ("d f/'qll')') __ 

E=- V ) S. (4) 

Median (xf) 

Instead of the median, one could use more efficient scales, such as an M-scale, 
but exploratory simulations indicate that they do not yield better results. 


3.1 S-estimators 

Let S = S (di, ..,d n ) be a scale M-estimator defined as solution of 


1 

n 


Ep 



= 6, 


(5) 


where 5 £ (0,1) controls the breakdown point, and p(t) £ [0,1] is smooth and 
nondecreasing in t > 0, with p (0) = 0 and maxp = 1. Then S-estimators (Davies 
1987) are defined as solutions of d3j) with S given by (0. 

The maximum finite-sample replacement breakdown point is attained when 


<5 = 0.5(l-()), (6) 

and its value is equal to this S. See (Maronna et ah, 2006, Section 6.4.2). 

A popular p is the bisquare given by 


p{d) 


1 - (1 - df if d < 1 
1 if d > 1. 


(7) 


Note that the usual bisquare p employed for regression is actually pbis (t) = 
p (t 2 ) . However, since we are dealing with the squared distances, we employ in 

0 Pbis (Vd) = p [d). 

It is easy to show that S-estimators satisfy the “estimating equations” 

^,E w ("1) ( x - ^) ( x - =s ( § ) 

i—l ^ ' 

s !>(!)<*-ri = ° < 9 > 

1 = 1 V 7 

with W = p' . That is, they satisfy the equations 0-0 which define monotonic 
M-estimators, with weight function W = p' . Here, since p is bounded W {d) d is 
not a nondecreasing function, and therefore this case is different from monotonic 
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p 2 5 10 20 30 40 50 

Efficiency 0.427 0.793 0.930 0.976 0.984 0.990 0.992 


Table 1: Efficiencies of the S-estimator with bisquare weights for dimension p 

M-estimators. In particular, the breakdown point is not bounded by (1 + p)~ ; 
as shown by ©■ 

For the bisquare, the weight function is 

W{t) = 3 (1 — f ) 2 1 (f < 1 ) 

(where I(.) denotes the indicator), which is decreasing. It seems intuitive that 
the weights of the observations should decrease with their “outlyingness”. How¬ 
ever it will be seen in the next Section that monotonicity is not necessarily 
favorable. 

3.2 S-estimators with a non-monotonic weight function 

Rocke (1996) showed that if W is nonincreasing, the efficiency of the estimator 
tends to one when p —> oo. A similar result was derived by Kent and Tyler 
(1996, page 1363) for their constrained M-estimators. 

Table[T]shows the efficiencies (to be defined later) of the bisquare S-estimator 
of scatter for normal p-dimensional data. 


However, it will be seen that the price for this increase in efficiency is a 
decrease in robustness. More precisely, although the breakdown point does not 
tend to zero with increasing p , the bias caused by contamination grows rapidly 
with p. This fact suggests that we need estimators with a controllable efficiency. 
But while in regression the efficiency has to be controlled to make it higher, 
here we need to prevent it from becoming “too high”. 

Based on the fact that for large p the p -variate standard normal distribu¬ 
tion N p ( 0 , 1 ) is concentrated “near” the spherical shell with radius yjp , Rocke 
(1996) proposed estimators with non-monotonic weight functions. Maronna et 
al. (2006) proposed a modification of Rocke’s “biflat” function, namely 


W{d) 



1(1 


7 < d < I + 7 ) 


( 11 ) 


with 


7 = min 



Xp( 1 -qQ 

p 



( 12 ) 


where y 2 (/?) is the /3-quantile of the y 2 distribution with p degrees of freedom, 
and a is “small” to control the efficiency. 
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Maronna et al (2006, Sec. 6.8) dealt only with location. The performance 
of the respective scatter matrix will be studied below. 


3.3 r—estimators 

t— estimators were proposed by Yohai and Zamar (1988) to obtain robust regres¬ 
sion estimators with controllable efficiency, and later Lopuhaa (1991) employed 
the same approach for multivariate estimation. This approach requires two 
functions pi and p 2 ■ For given (p, X) call uq (lb E) the solution of 


1 

n 




d(xi,p, X) 


vo 


= S. 


Then the estimator minimizes the “r-scale” 


<7 


(/b E) = v 0 (p, E) - V p 2 

n L ' 


i= 1 


d {x-j, p, S) \ 
vo (m,E) ) 


Here 

P2 (t) = Pi (^j (13) 

where c is chosen to regulate the efficiency. 

Originally, r-estimators were proposed to obtain estimators with higher ef¬ 
ficiency than S-estimators for small p, which required c > 1; but for large p we 
need c < 1 in order to decrease the efficiency. 


4 MM-estimators 


MM-estimators were initially proposed by Yohai (1987) to obtain regression 
estimators with a controllable efficiency. This approach has been used in the 
multivariate setting by Lopuhaa (1992) and Tatsuoka and Tyler (2000). Here 
we give a simplified version of the latter. 

Let (jlo^o'j be an initial very robust although possibly inefficient estimator. 
Put ^ 

dj — d ^x.j,^0;Eg^ 

and call S the respective M-scale 


1 

n 



= 6. 


(14) 
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( 15 ) 


The estimator is defined 


by (jl, with | S | 


= 1 such that 



= min, 


where di = d (^Xy /x, Sj and the constant c is chosen to control efficiency. 
It can be shown that the solution satisfies the equations 


(“^) ( x “^) ( x ~ V)' = E ( 16 ) 

7 — 1 X X 


with W = p ', 

Actually, it is not necessary to obtain the absolute minimum in (1151) . As 
with regression MM-estimators (Yohai 1987) it is possible to show that any 
solution of (flbl) for with the objective function (fTSl) is lower that for the initial 
estimator, has the same asymptotic behavior as the absolute minimum and has 
a breakdown point at least as high as the initial estimator. 

Like r-estimators, MM estimators were originally proposed to obtain esti¬ 
mators with higher efficiency than S-estimators for small p; but here for large p 
the constant has to chosen to prevent the efficiency becoming too high. 


5 The Stahel-Donoho estimator 


Let M (.) and S (.) be univariate location and dispersion statistics, e.g., the 
median and MAD. Define for any y £ RP the outlyingness r: 


r(y) 


= max 

a 


|a'y-M(a'X)| 

S'(a'X) 


(17) 


where the supremunr is over a £ R p with a ^ 0 or equivalently over the spherical 
surface S p = {a £ R p :|| a ||= 1}. Here a'X denotes a'xi,..., a'x„. Let W 
(the weight function ) be a positive function. The Stahel Donoho estimator of 
location and scatter, (t(X), V(X)), is a weighted mean and covariance matrix, 
with weights Wi = W(r(xj)). 

If W is continuous, and W{r) and W(r)r 2 are bounded for r > 0, the 
estimators have asymptotic breakdown point 0.5 for all p at continuous multi¬ 
variate models, if M and S have asymptotic breakdown point 0.5 (see Hampel 
et al.1986). The finite-sample breakdown point was derived by Tyler (1994). 

Maronna and Yohai (1995) showed that these estimators have order y in¬ 
consistency. Their asymptotic distribution was given by Zuo et al. (2004). 

Maronna and Yohai (1995) recommended a “Huber-type” W; however, fur¬ 
ther exploratory simulations indicate that better results are obtained with the 
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weight function described in the next section. 

The numerical computation of these estimators is difficult. Stahel (1981) 
proposed an approximate algorithm based on subsampling, the cost of which 
increases rapidly with p. Pena and Prieto (2007) proposed a fast algorithm for 
outlier detection which combines the projections on a set of 2 p deterministic 
directions that are extrema of the kurtosis, and a set of random directions. 
Although this method was originally meant for data analysis, it offers two further 
uses. First, the resulting projections can be employed to compute the Stahel- 
Donoho estimator; second, the method yields a robust (but probably inefficient) 
estimator that can be used as a starting point for the iterative computing of the 
estimators described above. Further details about this procedure are given in 
Section [j] 


6 Choosing p for MM- and r—estimators 


The most popular p in robust methods seems to be the bisquare. Yohai and 
Zamar (1997) proposed a p for regression with certain optimality properties. A 
simplified variant of this function is given by Muler et al (2002). Its version for 
multivariate estimation has weight function 


I 1 if d < 4 

W opt (d) = < q{d) if 4 < d < 9 , (18) 

[ 0 if d > 9 

where 

q (d) = —1.944 + 1.728d - 0.312d 2 + 0.016d 3 

is such that W is continuous and differentiable at d = 4 and d = 9. The respec¬ 
tive p function is 


p(d) 


1 

6.494 


d if d < 4 

s(d) if 4 < d < 9 , 
6.494 if d > 9 


where 

s (d) = 3.534 - 1.944d + 0.864d 2 - 0.104d 3 + 0.004d 4 . 

Figure Q] shows the bisquare and “optimal” weight functions, scaled with 
their respective tuning constants for the MM-estimator with 90% efficiency and 
p = 30. It is seen that the “optimal” p yields a smaller cutoff point. 


7 Computing issues 

All estimators described above are computed as iterative reweighted means and 
covariances, starting from an initial estimator. For S-, r— and MM estimators 
this algorithm ensures that the objective function descends at each iteration. 
This need not happen with the Rocke estimator, which has a non-monotonic 
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distance 


Figure 1: Bisquare and “optimal” weight functions. 


weight function. Maronna et al. (2006, Section 6.4.4) describe an algorithm 
which ensures attaining a local minimum. 

The (approximate) MYE is computed with 1000 subsamples and using the 
improvement described in (Maronna et al., 2006, Section 6.7.3). 

Since in all cases we attempt to minimize a non-convex function, the initial 
estimator is an essential part of the procedure. The standard way to obtain 
a robust and equivariant starting point is subsampling. However, ensuring a 
high enough breakdown point with large p may require an unpractically large 
number of subsamples. Besides, our experiments indicate that the breakdown 
point may be much lower than expected when n/p is “small” (say, < 5), which 
is not uncommon with high-dimensional data sets. For these reasons we need a 
faster and more reliable starting point. 

Pena and Prieto (2007) proposed an equivariant and semi-deterministic pro¬ 
cedure for outlier detection, based on finding directions that maximize or min¬ 
imize the kurtosis of the respective projections, plus a set of random ‘“specific 
directions” aimed at detecting outliers. Here we employ this procedure (which 
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they call “kurtosis plus specific directions”, henceforth abbreviated as “KSD”) 
as an estimator by itself. In the present setting it would not be competitive with 
the other estimators because its efficiency cannot be tuned (see Table [5] below), 
but we shall use it as an initial estimator competing with the sampling-based 
MVE. 

There are no theoretical results on the breakdown point of KSD. However, 
the simulations in (Pena and Prieto 2007, table 4) suggest that it can yield 
reliable results even with 40% of outliers. A limited theoretical result is given 
in Section [TT| 

7.1 Computing the Rocke Estimator 

Given a starting point, the Rocke estimator is computed iteratively as described 
in Section 9.6.3 of (Maronna et al., 2006). 

The form of the weight function ensures that for normal data sets, most 
of the data have positive weights. Since real data are seldom normal, it may 
happen that for data sets with large p and low ratio n/p the proportion of data 
with positive weights is small. If the data set is nearly collinear, this may cause 
X to be ill-conditioned, which affects the computation of Mahalanobis distances. 
For this reason, if at the first iteration the number of data with positive weights 
is less than 2 p, the tuning constant is enlarged until this number is > 2 p. 


8 Simulation 

As a reference distribution we take the p -variate normal N p (po, So) ■ In order 
to measure the performance of a given estimator we need a measure of 

“distance” between an estimator and the true value. Recall that the Kullback- 
Leibler divergence between densities /i and / 2 is 

d K L (/i, h) = J log ) h (z) dz. 

If both densities belong to the same parametric model with parameter vector 
9: fj (z) = / (z, 9j) , then d K l induces a “distance” between parameters: 

D(e 1 ,e 2 ) = dKL(f(. 1 e 1 )j(.,e 1 )). 

In the normal family, for p with known E we have 

D = (p - po)'(p - p 0 ), (19) 

and for E with known p we have 

D = trace — log |E ( (' 1 E| — p (20) 
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Since all estimators are equivariant we may in the simulations take without 
loss of generality (po, £ 0 ) = (0,1). 

Each estimator is evaluated by D = Monte Carlo average of the Kullback- 
Leibler divergences D given in (TT9li - (f20l) . 

We generate N = 500 samples X = [x 4 •] of size n from N p (0,1). 

The estimators compared are: 

• Rocke with tuning constant a; see © 

• MM with bisquare and “optimal” p, with tuning constant c; see m 

• r with bisquare and “optimal” p, with tuning constant c; see (1131) 

• Stahel-Donoho with weight function W(r) = W op t(r/c) where W op t is 
defined in ijlSl) 

For all estimators we employed both the MVE and KSD estimators as start¬ 
ing values. The tuning constants were chosen to attain an efficiency of 0.9 (see 
below). 

We add for completeness four other estimators with uncontrollable efficiency: 

• The S-estimator (S-E) with S = 0.5 in 0 and biweight p. The “optimal” 
p yielded similar results. 

• The MVE and KSD estimators. 

• The MCD with breakdown point 0 and one-step reweighting, computed 
with the code in the library LIBRA (Verboven and Hubert 2005). 

All scatter estimators are corrected for “size” by means of 0, except for 
the MCD for which the code applies a consistency correction. 

8.1 No contamination 

Call C the sample covariance matrix. For each estimator X we define 


D{ C) 


efficiency 



The constants for each estimator are chosen to attain finite-sample efficien¬ 
cies of 0.90. To this end we computed for each estimator its tuning constants 
for n = Kp with I\ = 5, 10 and 20 and p between 5 and 50, and then fitted the 
constants as functions of n and p. 

The simulation showed the efficiency cannot be controlled in all cases, namely 

• For p = 15 the maximum efficiency of the Rocke estimator is 0.876 for all 
as, and is still lower for smaller p. The explanation is that when a tends 
to zero, the estimator does not tend to the covariance matrix unless p is 
large enough. 
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p 

n 

Scatter 

Location 

10 

50 

0.40 

0.62 


100 

0.70 

0.85 


200 

0.86 

0.95 

20 

100 

0.44 

0.62 


200 

0.80 

0.89 


400 

0.90 

0.95 

50 

250 

0.47 

0.58 


500 

0.82 

0.85 


1000 

0.93 

0.96 


Table 2: Efficiencies of the KSD estimator 


• The minimum efficiency of the r-estimators over all constants c tends to 
one with increasing p , for both p— functions. In particular, it is >0.95 for 
p > 50. The reason is that when c is small, the r—scale approaches the 
M-scale, and therefore the r—estimators approaches the S-estimator. 

Table [2] shows the efficiencies of the KSD estimator. 


It seen that the efficiency depends heavily on the ratio n/p and can be rather 
low for n/p = 5. 

8.2 Contamination 

We deal first with shift contamination. For contamination rate e, let to = [ne]. 
Given K , we replace the first coordinate: 


Xu <— 7®ii + K, i = 1,..., to 

The outlier size I\ is varied between 1 and 12 in order to find the maximum 
D. The constant 7 determines the scatter of the outliers. We employed the 
values e = 0.1 and 0.2, and 7 = 0 and 0.5. 

The simulations were run for p = 5, 10, 15, 20 and 30, and n = mp with 
to = 5, 10 and 20. Since the complete results are rather bulky, they are given 
in Section lll.il Here we give the most important conclusions from them. Ex¬ 
amination of the tables shows that 

• The price paid for the high efficiency of S-E is a large loss of robustness. 

• KSD is always better than MVE as a starting estimator for MM and r. 

• KSD is generally better than subsampling for S-D. 

• The “optimal” p is always better than the bisquare p for both MM and r 
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p 

£ 

MM 

r 

Rocke 

S-D 

S-E 

MCD 

5 

0.1 

0.85 

0.89 

1.26 

0.99 

1.09 

1.99 


0.2 

2.27 

2.46 

3.74 

4.53 

4.38 

17.58 

10 

0.1 

1.67 

1.77 

1.53 

1.61 

3.54 

6.66 


0.2 

3.88 

4.53 

1.67 

7.94 

11.26 

21.89 

15 

0.1 

2.38 

2.98 

1.95 

2.26 

6.68 

12.53 


0.2 

5.68 

7.85 

4.47 

12.31 

19.82 

28.33 

20 

0.1 

3.32 

4.59 

2.49 

3.00 

10.03 

16.46 


0.2 

7.90 

12.62 

3.17 

17.09 

25.41 

32.04 

30 

0.1 

5.34 

8.56 

3.03 

4.64 

18.39 

17.66 


0.2 

14.21 

20.71 

5.61 

29.66 

49.14 

34.02 


Table 3: Maximum mean Ds of scatter matrices, for n = lOp and 7 = 0 . All 
estimators start from KSD. MM and r use “optimal” p. 

• In all situations, the best estimators are MM and r with “optimal” p, 
Rocke, and S-D, all starting from KSD. 

• Although the results for 7 = 0 and 0.5 are different, the comparisons 
among estimators are almost the same. 

• The relative performances of the estimators for location and scatter are 
similar. 

• The relative performances of the estimators for n = 5 p, 10 p and 20 p are 
similar. 

For these reasons we give in Table [3] a reduced version of the results, for n = 
10 p and 7 = 0 , and the maximum Ds of the scatter estimators corresponding 
to MM and r (both with “optimal” p), Rocke and S-D, all starting from KSD. 
For completion we add S-E with KSD start, and the reweighted MCD. 

The results for estimators with efficiency less than 0.9 are shown in italics. 


It is seen that 

• The performance of S-D is competitive for e = 0.1, but is poor for e = 0.2. 

• For p < 10, MM has the best overall performance. 

• For p > 15, Rocke has the best overall performance. 

• The MCD has a poor performance. 

Figure [2] shows the values of D as a function of the outlier size K for some of 
the estimators in the case p = 20, n = 200 and 7 = 0 . Here “MM-Opt” stands 
for “MM with ’optimal’ p". All estimators in the second panel start from KSD. 
The plot confirms the superiority of Rocke+KSD. 
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Figure 2: Mean D of of scatter estimators for p = 20, n = 200, e = 0.1 and 
7 = 0 as a function of the outlier size K. All iterative estimators start from 
KSD. 


8.3 Comparison with a non-equivariant estimator 

Recently Hubert et al, (2015) proposed two deterministic estimators, called DetS 
and DetMM, of which the latter has a tuneable efficiency. We compare it with 
Rocke+SD. The nominal efficiency of DetMM is chosen as 0.90. The scenario 
is the same as above. However, since DetMM is not equivariant, the model 
is now N p (0, S 0 ) where So has unit diagonal elements and all non-diagonal 
elements equal to p. We chose the extreme cases p = 0 and = 0.9. Since both 
yield qualitatively similar results, we show in Tabled] only the results from the 
first case. 
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Scatter 



Location 




n = 

100 

200 

400 

100 

200 

400 

0.1 

0 

Rocke+KSD 

3.96 

2.41 

1.63 

0.39 

0.27 

0.19 



DetMM 

26.97 

26.59 

26.19 

5.93 

5.72 

5.32 


0.5 

Rocke+KSD 

4.73 

2.95 

2.38 

0.54 

0.41 

0.36 



DetMM 

18.01 

17.89 

18.99 

3.46 

3.29 

3.18 

0.2 

0 

Rocke+KSD 

10.62 

5.22 

3.58 

1.47 

0.73 

0.52 



DetMM 

213.66 

164.18 

156.82 

81.29 

78.19 

77.50 


0.5 

Rocke+KSD 

12.08 

9.24 

8.67 

2.33 

1.95 

1.90 



DetMM 

118.79 

111.87 

109.79 

46.96 

46.90 

45.81 


Table 4: Comaprison of Rocke and DetMM estimators: Maximum mean D for 

P = o 


P 

n 

Rocke+MVE 

Rocke+KSD 

DetMM 

20 

100 

0.62 

0.06 

0.20 


200 

0.98 

0.079 

0.30 


400 

1.31 

0.15 

0.54 

50 

250 

5.03 

0.51 

1.61 


500 

6.54 

1.22 

3.11 


1000 

12.72 

3.07 

6.51 

80 

400 

14.55 

6.43 

5.97 


800 

22.46 

14.90 

12.23 


1600 

65.45 

22.48 

26.64 

100 

500 

26.86 

59.18 

11.79 


1000 

74.01 

91.63 

24.47 


2000 

152.06 

113.54 

47.41 


Table 5: Mean computing times of estimators in seconds 


The performance of DetMM is clearly poor. We have not been able to find 
an explanation for this disappointing behavior. 

8.4 Computing times 

We compare the computing times of the Rocke estimator with MVE and KSD 
starts, and of DetMM. The results are the average of 10 runs with normal 
samples, on a PC with Intel TM12 Duo CPU and 3.01 GHz. The values of n 
were 5 p, lOp and 20p, with p between 10 and 100. The number of subsamples 
for the MVE was made to increase slowly as 50p. Table [5] displays the results, 
where for brevity we show only the values for p = 20, 50, 80 and 100. 


It is seen that Rocke+KSD is faster than DetMM for p < 80, and Rocke+MVE 
. However it is slower than DetMM for p = 100. This rapid increase in comput- 
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ing time is probably due to the optimization procedure employed by KSD, and 
may be improved upon by choosing a more efficient optimizer. 


9 A real example 

We deal with the well-known wine data set, available at the UCI machine learn¬ 
ing repository: https://archive.ics.uci.edu/ml/datasets/Wine, which has been 
employed as a benchmark data set for pattern recognition; see e. g. (Aeberliard 
et al, 1994), and consists of three classes with 13 variables. The estimators 
were applied to the data of class 3, with n = 48 and p = 13. Since KSD and 
MVE yielded similar results as initial estimators, we show only the results cor¬ 
responding to the former. Figures [3] and [4] contain the QQ-plots of the (squared) 
Mahalanobis distances for the different estimators. 


Rocke + KSD MM + KSD 


200 

+ 

80 

-■---1- 

150 

++ 

60 

,++ 

100 

++ 

40 

H- 

+ 


-H- 


+ 

50 

~r 

20 


0 


0 



O'--*- O'---*-*- 

0 10 20 30 0 10 20 30 


DetMM 


S-D 




Figure 3: Wine data: Ordered Mahalanobis distances vs. y^-quantiles. 
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Covariance S-E + KSD 




0 10 20 30 


Figure 4: Wine data: Wine data: Ordered Mahalanobis distances vs. \p~ 
quantiles. 


Rocke, MM and DetMM pinpoint respectively 8, 6 and 5 possible outliers; 
S-D seems to pinpoint an excessive number of possible outliers; while S-E and r 
behave like the classical estimator, showing no suspicious points. Some subject- 
matter knowledge would be necessary to decide how atypical the suspicious 
points are. 


10 Conclusions 

The Rocke estimator has a controllable efficiency for p > 15. With equal effi¬ 
ciencies, the Rocke estimator with KSD start outperforms all its competitors 
for shift contamination Its computing time is competitive for p < 100, and can 
probably be improved upon. It can therefore be recommended for estimation 
with p > 15. 
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For p < 15 we can recommend MM with “optimal” p and KSD start. 
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11 Appendix 

11.1 Simulation results 

Tables [6] to [15] in this section contain the detailed results of the simulation. In 
each scenario, the smallest and next-to-smallest values are marked as bold and 
italic, respectively. This is done only for estimators with controllable efficiency; 
in particular, Rocke is not considered for p < 10, for its efficiency in this case is 
less than 0.90. 

One would expect the values for a given estimator to decrease when n in¬ 
creases. However, in many cases this does not hold for the estimators based 
on the MVE. We have re-run the simulations with a different seed, and also 
employed medians instead of means to rule out atypical cases, but this pattern 
appears nevertheless. We have not been able to find an explanation for this 
phenomenon. Since it always affects the largest values, it does not influence the 
conclusions. 


11.2 Approximations for the tuning constants 

For the MM estimator with MVE start, the constant c in (fl5l) is approximated 



where the coefficients depend on the p —function as follows: 


p 

a 

b 

c 

d 

e 

Bisquare 

0.540 

3.538 

-7.505 

1.114 

-0.968 

Optimal 

0.469 

3.158 

-0.928 

1.167 

-1.698 


For the MM estimator with KSD start, c is approximated by 


c 

b 

= a + - 
P 

1 P 

+ C—, 

n 


p 

a 

b 

c 

Bisquare 

0.716 

2.572 

-0.786 

Optimal 

0.612 

4.504 

-1.112 
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For the Rocke estimator, the value of a in m which yields 90% efficiency 
is approximated for p > 15 by 


a = ap b n c 


with 


Start 

a 

b 

c 

MVE 

0.00436 

-0.5030 

0.4214 

KSD 

0.00216 

-1.0078 

0.8156 


For the r-estimator the constant c in m which yields 90% efficiency is 
approximated for both MVE and KSD starts by 

c = ap b (21) 

with 


P 

a 

b 

Bisquare 

6.2984 

-0.8458 

Optimal 

2.9987 

-0.4647 


Finally, for the Stahel-Donoho estimator the weight function is W (r) = 
W op t(d/c ) where the values of c that yield 90% efficiency are given by 


c 

b 

— CL H— 

n 

1 P 

+ c—, 
n 



a 

b 

c 

Subsampling 

5.116 

63.820 

2.213 

KSD 

6.564 

0.211 

24.286 


Here, 9c is the cutoff point(the value at which W vanishes). 


11.3 The breakdown point of the KSD estimator 

The KSD procedure is defined in the same way as the Stahel-Donoho estimator, 
but with a different set of directions. The population version is as follows. Let 
x be a random vector with distribution F. Let U C R p be a set of directions u 
with ||u|| = 1. Let p and a be univariate robust location and scale estimators. 
The outlyingness of a point z gR p is defined as 


OL (z) 


= max 
uej/ 


|u'z -p F (u'x) | 
<JF (u'x) 


The location and scale estimators are defined as weighted means and covariance 
matrix with weights W (OL (x)) where W(t) > 0 is a nonincreasing function for 
t > 0. 
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For a sample, the estimator is defined as above with F the empirical distribu¬ 
tion. In the (theoretical) Stahel-Donoho estimator, U is the set of all directions, 
and IF is a smooth function; in actual practice, a finite set of directions obtained 
by subsampling is employed. 

The KSD procedure employs two sets of directions: U = U\ U t/ 2 . The first 
one is deterministic, and consists of a set of p orthogonal directions maximizing 
the kurtosis of u'x and p directions minimizing it. The other is a set of random 
“specific directions” obtained through a stratified sampling. We shall deal only 
with the first one. Besides, W is of “hard rejection” type: W{t) = 1 (t < f$) 
where /3 depends on p. 

Theoretical calculations with KSD seem extremely difficult, and for this 
reason we will limit ourselves to a very simplified case. We consider only the 
population case with point-mass contamination; furthermore we assume that 
the uncontaminated data are elliptically distributed. It will be shown that if p 
and cr have breakdown 0.5, so has the KSD estimator. 

Let Fo be an elliptical distribution with fourth moments and consider the 
contaminated distribution F = (1 — e) F 0 + eS Xo with e < 0.5. Because of the 
estimator’s equivariance it may be assumed that Fq is radial, with zero means 
and identity covariance matrix, and that Xq = K bi where bj are the elements of 
the canonical base and K > 0. Put A = Ef 0 x 4 , where x\ is the first coordinate of 
x. The rotational symmetry implies that A = Ef 0 (u'x) 4 for all u = (ui ,..., u p )' 
with ||u|| = 1. 

We will show that the direction of the contamination, i.e., u = bi, is always 
included in the set. It is straightforward to show that the kurtosis of a projection 
u'x under F is 


Kurtf (u'x) 


where s = u\ and 


a + bsK 2 + cs 2 K 4 
9(>) =: (1 + e.IO) ’ 


( 22 ) 


a = (1 — e) A, b = 6 (1 — e) s 2 , c = e (l — 4e + 6e 2 — 3e 3 ) 

It follows that Kurt f (u'x) depends on u only through s = u\ £ [0,1]. A 
laborious but straightforward calculation shows that the derivative of g (s) has 
the form g' (s) — u{s)v (s), where u (s) > 0 does not depend on K , and 


v ( s) = (1 — e) (3e — A) + sK 2 (l — 4e + 3 2 ) . 


The location of the extrema depends only on the sign of v. Although the 
result holds in general, to simplify the analysis we consider only the case A > 1.5. 
and we assume K 2 > A. There are two cases. If e > 1/3, then v (s) < 0 for 
s G [0, 1], and therefore u = bi is a minimizer of Kurtp (u'x). If e < 1/3 there 
are maxima at s = 1 and s = 0, and therefore the set of maximizing directions 
contains bi and a set of orthogonal u’s which are orthogonal to bi 
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It follows that 


OL (x 0 ) > 


\K- f-i(xi) | 

cr(xi) 


Note that fi (a’i) and a (x\) depend on K, but since £ < 0.5 they are bounded. 
Therefore for K large enough, OL (xo) will be larger than the cutoff value /? 
and will therefore have null weight. This finishes the proof. 
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Start 


Scatter 



Location 


0 

n = 


25 

50 

100 

25 

50 

100 


S-E 

MVE 

1.24 

1.21 

0.98 

0.28 

0.27 

0.24 


S-E 

KSD 

1.27 

1.09 

0.90 

0.28 

0.27 

0.24 


MM-Bisq. 

MVE 

1.26 

1.28 

1.19 

0.28 

0.28 

0.24 


MM-Bisq. 

KSD 

1.26 

1.27 

1.24 

0.29 

0.28 

0.25 


MM-Opt. 

MVE 

1.29 

0.87 

0.74 

0.29 

0.24 

0.21 


MM-Opt. 

KSD 

1.26 

0.85 

0.72 

0.30 

0.24 

0.21 


r-Bisq. 

MVE 

1.43 

1.60 

1.44 

0.33 

0.31 

0.26 


r-Bisq. 

KSD 

1.43 

1.55 

1.37 

0.32 

0.30 

0.26 


r-Opt. 

MVE 

1.24 

0.95 

0.76 

0.29 

0.25 

0.22 


r-Opt. 

KSD 

1.27 

0.89 

0.75 

0.29 

0.25 

0.21 


Rocke 

MVE 

2.28 

1.49 

1.01 

0.37 

0.30 

0.23 


Rocke 

KSD 

2.64 

1.26 

0.70 

0.39 

0.25 

0.19 


S-D 

Subs. 

1.12 

0.90 

0.68 

0.28 

0.22 

0.17 


S-D 

KSD 

1.26 

0.99 

0.79 

0.29 

0.23 

0.18 


KSD 


3.93 

1.22 

0.45 

0.43 

0.24 

0.15 


MCD 


3.31 

1.99 

1.17 

0.47 

0.42 

0.29 


MVE 


4.52 

3.27 

2.11 

0.56 

0.45 

0.36 

0.5 

S-E 

MVE 

1.11 

0.91 

0.72 

0.27 

0.22 

0.17 


S-E 

KSD 

1.13 

0.84 

0.64 

0.27 

0.22 

0.17 


MM-Bisq. 

MVE 

1.14 

1.08 

0.93 

0.27 

0.23 

0.18 


MM-Bisq. 

KSD 

1.16 

1.05 

0.98 

0.27 

0.23 

0.19 


MM-Opti 

MVE 

1.22 

0.94 

0.75 

0.29 

0.25 

0.21 


MM-Opti 

KSD 

1.28 

0.96 

0.80 

0.30 

0.25 

0.21 


r-Bisq. 

MVE 

1.31 

1.34 

1.11 

0.28 

0.26 

0.21 


r-Bisq. 

KSD 

1.30 

1.31 

1.08 

0.28 

0.26 

0.21 


r-Opti 

MVE 

1.21 

0.99 

0.75 

0.29 

0.25 

0.21 


r-OPti 

KSD 

1.24 

0.88 

0.74 

0.30 

0.24 

0.21 


Rocke 

MVE 

2.10 

1.16 

0.74 

0.35 

0.26 

0.21 


Rocke 

KSD 

2.45 

1.09 

0.68 

0.36 

0.25 

0.18 


S-D 

Subs. 

1.05 

0.77 

0.55 

0.26 

0.20 

0.14 


S-D 

KSD 

1.15 

0.80 

0.59 

0.28 

0.21 

0.16 


KSD 


3.74 

1.07 

0.64 

0.41 

0.22 

0.18 


MCD 


3.04 

1.33 

0.61 

0.44 

0.28 

0.17 


MVE 


4.12 

2.74 

1.81 

0.51 

0.43 

0.34 


Table 6: Simulation: Maximum mean D for scatter and location with p = 5 
and 10% contamination 


24 



7 


Start 


Scatter 



Location 


0 

n = 


25 

50 

100 

25 

50 

100 


S-E 

MVE 

9.10 

6.02 

6.23 

2.63 

2.15 

2.34 


S-E 

KSD 

5.99 

4.38 

4.26 

1.93 

1.70 

1.87 


MM-Bisq. 

MVE 

7.83 

5.21 

5.43 

2.29 

1.86 

2.03 


MM-Bisq. 

KSD 

5.11 

4.03 

4.25 

1.65 

1.43 

1.57 


MM-Opti 

MVE 

7.90 

3.89 

3.79 

2.05 

1.27 

1.31 


MM-Opti 

KSD 

3.90 

2.27 

2.05 

1.17 

0.83 

0.80 


r-Bisq. 

MVE 

8.25 

6.35 

6.48 

2.46 

2.15 

2.25 


r-Bisq. 

KSD 

7.02 

5.67 

5.50 

2.15 

1.92 

2.06 


r-Opti 

MVE 

7.56 

4.01 

3.86 

1.98 

1.29 

1.33 


r-Opti 

KSD 

4.18 

2.46 

2.04 

1.24 

0.86 

0.76 


Rocke 

MVE 

13.3 

6.76 

5.94 

2.25 

1.54 

1.61 


Rocke 

KSD 

8.23 

3.74 

2.13 

1.45 

0.84 

0.64 


S-D 

Subs. 

5.49 

4.30 

3.86 

1.38 

1.22 

1.19 


S-D 

KSD 

12.35 

4.53 

3.95 

2.30 

1.22 

1.14 


KSD 


11.55 

5.43 

2.13 

1.96 

1.17 

0.69 


MCD 


19.68 

17.58 

16.16 

8.07 

7.93 

6.87 


MVE 


18.78 

8.99 

5.74 

3.25 

1.81 

1.26 

0.5 

S-E 

MVE 

5.43 

4.49 

4.20 

1.56 

1.42 

1.36 


S-E 

KSD 

4.50 

3.23 

3.08 

1.49 

1.28 

1.34 


MM-Bisq. 

MVE 

5.06 

4.44 

4.57 

1.49 

1.45 

1.53 


MM-Bisq. 

KSD 

4.79 

4.21 

4.48 

1.46 

1.40 

1.52 


MM-Opti 

MVE 

4.30 

3.17 

2.79 

1.25 

1.05 

1.06 


MM-Opti 

KSD 

3.95 

2.65 

2.60 

1.23 

1.01 

1.04 


r-Bisq. 

MVE 

6.45 

5.60 

5.39 

1.87 

1.72 

1.67 


r-Bisq. 

KSD 

6.16 

5.38 

5.10 

1.85 

1.65 

1.64 


r-Opti 

MVE 

4.36 

3.22 

2.76 

1.30 

1.07 

1.07 


r-Opti 

KSD 

3.72 

2.63 

2.63 

1.17 

0.97 

1.02 


Rocke 

MVE 

6.89 

4.60 

3.84 

1.65 

1.34 

1.37 


Rocke 

KSD 

5.84 

2.70 

1.72 

1.22 

0.87 

0.73 


S-D 

Subs. 

4.13 

3.03 

2.60 

1.05 

0.86 

0.77 


S-D 

KSD 

4.83 

3.90 

3.80 

1.11 

0.98 

0.96 


KSD 


6.88 

2.62 

1.59 

1.21 

0.84 

0.69 


MCD 


5.35 

3.34 

2.56 

1.81 

1.43 

1.62 


MVE 


9.67 

7.07 

5.78 

2.11 

1.92 

1.86 


Table 7: Simulation: Maximum mean D for scatter and location with p = 5 
and 20% contamination 
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Start 

Scatter 


Location 

0 

n = 


50 

100 

200 

50 

100 

200 


S-E 

MVE 

4.32 

3.82 

3.59 

0.76 

0.69 

0.67 


S-E 

KSD 

4.53 

3.54 

3.21 

0.75 

0.64 

0.62 


MM-Bisq. 

MVE 

3.56 

3.23 

3.16 

0.66 

0.63 

0.65 


MM-Bisq. 

KSD 

3.59 

3.18 

2.98 

0.65 

0.62 

0.60 


MM-Opt. 

MVE 

2.62 

1.78 

1.54 

0.44 

0.34 

0.32 


MM-Opt. 

KSD 

2.52 

1.67 

1.45 

0.43 

0.33 

0.31 


r-Bisq. 

MVE 

4.32 

3.56 

3.29 

0.76 

0.66 

0.66 


r-Bisq. 

KSD 

4.27 

3.33 

3.19 

0.75 

0.65 

0.65 


r-Opt. 

MVE 

2.88 

1.85 

1.54 

0.48 

0.35 

0.32 


r-Opt. 

KSD 

2.64 

1.77 

1.52 

0.46 

0.35 

0.32 


Rocke 

MVE 

3.07 

1.85 

1.49 

0.45 

0.31 

0.28 


Rocke 

KSD 

4.05 

1.53 

1.03 

0.43 

0.26 

0.21 


S-D 

Subs 

2.70 

1.80 

1.45 

0.45 

0.32 

0.28 


S-D 

KSD 

3.04 

1.61 

1.04 

0.48 

0.29 

0.24 


KSD 


6.76 

2.40 

0.98 

0.63 

0.37 

0.22 


MCD 


10.95 

6.66 

5.28 

1.87 

1.20 

0.94 


MVE 


6.49 

3.81 

2.68 

0.68 

0.41 

0.32 

0.5 

S-E 

MVE 

4.19 

3.33 

3.12 

0.66 

0.55 

0.51 


S-E 

KSD 

3.99 

3.05 

2.79 

0.65 

0.53 

0.5 


MM-Bisq 

MVE 

3.73 

2.76 

2.58 

0.61 

0.5 

0.47 


MM-Bisq. 

KSD 

3.41 

2.73 

2.50 

0.57 

0.49 

0.46 


MM-Opt. 

MVE 

3.27 

2.04 

1.57 

0.52 

0.36 

0.33 


MM-Opt. 

KSD 

2.74 

1.77 

1.51 

0.49 

0.35 

0.32 


r-Bisq. 

MVE 

4.01 

3.11 

2.68 

0.65 

0.51 

0.47 


r-Bisq. 

KSD 

3.80 

2.86 

2.56 

0.63 

0.53 

0.47 


r-Opt. 

MVE 

3.25 

2.16 

1.73 

0.54 

0.39 

0.32 


r-Opt. 

KSD 

2.75 

1.76 

1.50 

0.49 

0.35 

0.32 


Rocke 

MVE 

3.94 

2.14 

1.72 

0.61 

0.39 

0.36 


Rocke 

KSD 

3.99 

1.67 

1.34 

0.51 

0.30 

0.28 


S-D 

Subs 

2.35 

1.45 

1.09 

0.40 

0.26 

0.21 


S-D 

KSD 

2.68 

1.74 

1.34 

0.42 

0.29 

0.22 


KSD 


5.55 

1.63 

1.25 

0.58 

0.32 

0.26 


MCD 


5.43 

2.70 

1.72 

0.79 

0.51 

0.37 


MVE 


8.31 

4.79 

3.73 

0.99 

0.67 

0.61 


Table 8: Simulation: Maximum mean D for scatter and location with p = 10 
and 10% contamination 
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Start 


Scatter 



Location 


0 

n = 


50 

100 

200 

50 

100 

200 


S-E 

MVE 

15.20 

12.83 

12.58 

5.12 

4.87 

5.13 


S-E 

KSD 

13.65 

11.26 

10.81 

4.2 

4.03 

4.25 


MM-Bisq. 

MVE 

14.88 

12.15 

11.86 

4.86 

4.58 

4.76 


MM-Bisq. 

KSD 

9.69 

8.01 

8.29 

3.31 

3.05 

3.30 


MM-Opt. 

MVE 

11.67 

8.72 

8.28 

3.01 

2.51 

2.45 


MM-Opt. 

KSD 

6.59 

3.88 

3.52 

1.83 

1.21 

1.18 


r-Bisq. 

MVE 

15.76 

13.13 

13.13 

5.38 

5.19 

5.45 


r-Bisq. 

KSD 

12.32 

10.47 

10.58 

4.41 

4.21 

4.56 


r-Opt. 

MVE 

11.87 

8.81 

8.32 

3.06 

2.53 

2.47 


r-Opt. 

KSD 

7.26 

4.53 

4.04 

1.95 

1.41 

1.33 


Rocke 

MVE 

11.94 

8.27 

7.49 

2.05 

1.68 

1.64 


Rocke 

KSD 

9.40 

4.51 

2.59 

1.44 

0.84 

0.65 


S-D 

Subs 

12.20 

10.07 

9.08 

3.13 

2.88 

2.89 


S-D 

KSD 

34.59 

7.94 

7.09 

6.02 

2.35 

2.18 


KSD 


20.90 

10.96 

8.68 

4.02 

2.57 

2.62 


MCD 


26.45 

21.89 

17.84 

10.50 

9.21 

6.82 


MVE 


19.69 

12.08 

8.11 

3.67 

2.54 

1.72 

0.5 

S-E 

MVE 

25.07 

20.87 

17.18 

7.65 

6.59 

5.64 


S-E 

KSD 

11.44 

10.09 

9.86 

3.49 

3.56 

3.71 


MM-Bisq. 

MVE 

27.04 

22.22 

18.21 

8.42 

7.18 

6.12 


MM-Bisq. 

KSD 

9.35 

7.93 

8.86 

2.95 

2.84 

3.09 


MM-Opt. 

MVE 

23.01 

18.10 

14.64 

6.23 

5.03 

4.21 


MM-Opt. 

KSD 

6.94 

4.76 

4.82 

1.90 

1.58 

1.65 


r-Bisq. 

MVE 

25.78 

21.57 

17.79 

7.94 

6.9 

5.93 


r-Bisq. 

KSD 

11.19 

10.15 

9.91 

3.49 

3.52 

3.56 


r-Opt. 

MVE 

22.05 

17.67 

14.28 

6.01 

4.92 

4.11 


r-Opt. 

KSD 

6.78 

4.85 

4.88 

1.92 

1.61 

1.65 


Rocke 

MVE 

24.89 

19.22 

15.18 

6.01 

4.89 

4.09 


Rocke 

KSD 

9.16 

5.18 

4.45 

1.84 

1.41 

1.44 


S-D 

Subs 

8.91 

6.11 

5.49 

2.14 

1.67 

1.55 


S-D 

KSD 

9.19 

7.18 

6.64 

2.04 

1.72 

1.66 


KSD 


12.45 

7.03 

4.34 

2.70 

1.99 

1.46 


MCD 


22.33 

18.35 

15.85 

9.24 

7.19 

7.09 


MVE 


37.74 

28.83 

20.74 

7.95 

7.81 

6.11 


Table 9: Simulation: Maximum mean D for scatter and location with p = 10 
and 20% contamination 
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Start 


Scatter 



Location 



n = 


75 

150 

300 

75 

150 

300 

0 

S-E 

MVE 

6.93 

6.68 

6.35 

1.03 

1.17 

1.22 


S-E 

KSD 

7.15 

6.68 

6.43 

0.99 

1.16 

1.21 


MM-Bisq. 

MVE 

5.47 

5.65 

5.65 

0.88 

1.13 

1.23 


MM-Bisq. 

KSD 

5.19 

5.17 

5.15 

0.81 

1.04 

1.08 


MM-Opt. 

MVE 

3.83 

2.92 

2.55 

0.50 

0.48 

0.46 


MM-Opt. 

KSD 

3.33 

2.38 

2.25 

0.46 

0.38 

0.41 


r-Bisq. 

MVE 

7.42 

7.91 

7.98 

1.20 

1.56 

1.68 


r-Bisq. 

KSD 

7.93 

7.85 

7.73 

1.68 

1.52 

1.47 


r-Opt. 

MVE 

4.11 

3.23 

2.73 

0.55 

0.59 

0.48 


r-Opt. 

KSD 

3.69 

2.98 

2.63 

0.51 

0.49 

0.48 


Rocke 

MVE 

3.80 

2.58 

2.04 

043 

0.34 

0.31 


Rocke 

KSD 

3.67 

1.95 

144 

0.36 

0.27 

0.23 


S-D 

Subs. 

3.95 

3.08 

2.57 

0.51 

0.49 

0.45 


S-D 

KSD 

3.83 

2.26 

1.42 

0.50 

0.38 

0.30 


KSD 


8.08 

4.19 

1.66 

0.69 

0.53 

0.31 


MOD 


13.71 

12.53 

10.51 

2.78 

2.06 

1.99 


MVE 


8.44 

5.09 

3.19 

0.73 

0.55 

0.32 

0.5 

S-E 

MVE 

7.83 

6.38 

5.73 

0.98 

1.02 

0.93 


S-E 

KSD 

6.59 

5.68 

5.26 

0.86 

0.93 

0.92 


MM-Bisq. 

MVE 

8.61 

5.29 

4.71 

1.09 

0.89 

0.9 


MM-Bisq. 

KSD 

4.84 

4.58 

4.29 

0.71 

0.81 

0.8 


MM-Opt. 

MVE 

7.55 

4.16 

3.33 

0.91 

0.62 

0.55 


MM-Opt. 

KSD 

3.55 

2.79 

2.48 

0.50 

0.46 

0.45 


r-Bisq. 

MVE 

8.37 

6.36 

6.23 

1.06 

1.09 

1.12 


r-Bisq. 

KSD 

6.05 

6.17 

6.17 

0.87 

1.05 

1.11 


r-Opt. 

MVE 

7.36 

4.08 

3.28 

0.89 

0.62 

0.55 


r-Opt. 

KSD 

3.87 

2.93 

2.56 

0.53 

0.51 

0.47 


Rocke 

MVE 

7.69 

4.07 

3.21 

0.92 

0.60 

0.52 


Rocke 

KSD 

4.03 

2.16 

1.68 

0.47 

0.35 

0.31 


S-D 

Subs. 

3.38 

2.44 

1.90 

0.43 

0.38 

0.33 


S-D 

KSD 

3.69 

2.14 

1.31 

0.46 

0.34 

0.24 


KSD 


7.32 

2.66 

1.66 

0.69 

0.43 

0.32 


MOD 


9.10 

7.06 

5.95 

1.30 

1,19 

0.99 


MVE 


15.05 

8.54 

5.92 

1.53 

1.09 

0.86 


Table 10: Simulation: Maximum mean D for scatter and location with p = 15 
and 10% contamination 
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Start 


Scatter 


Location 


n = 


75 

150 

300 

75 

150 

300 

0 

S-E 

MVE 

20.06 

20.23 

20.43 

6.97 

7.91 

8.44 


S-E 

KSD 

23.22 

19.82 

19.59 

6.71 

6.77 

7.3 


MM-Bisq. 

MVE 

14.06 

21.25 

19.92 

8.95 

8.69 

8.87 


MM-Bisq. 

KSD 

15.42 

12.66 

12.15 

5.56 

4.99 

5.02 


MM-Opt 

MVE 

10.75 

13.87 

12.49 

2.54 

3.85 

3.73 


MM-Opt. 

KSD 

10.41 

5.68 

5.10 

2.67 

1.67 

1.68 


r-Bisq. 

MVE 

23.28 

24.58 

24.93 

8.87 

10.66 

11.50 


r-Bisq. 

KSD 

22.59 

21.83 

20.54 

9.72 

9.17 

9.12 


r-Opt. 

MVE 

11.17 

14.42 

12.79 

2.66 

4.02 

3.84 


r-Opt. 

KSD 

11.73 

7.85 

7.52 

3.01 

2.35 

2.46 


Rocke 

MVE 

8.81 

10.3 

8.68 

1.27 

1.80 

1.67 


Rocke 

KSD 

9.96 

4.47 

3.31 

1.40 

0.76 

0.68 


S-D 

Subs. 

23.84 

21.68 

21.09 

6.90 

6.68 

6.99 


S-D 

KSD 

98.01 

12.31 

10.80 

12.54 

3.97 

3.74 


KSD 


31.92 

15.3 

14.12 

6.59 

4.02 

4.57 


MCD 


35.14 

28.33 

20.25 

13.28 

10.30 

8.06 


MVE 


21.93 

26.67 

19.41 

4.15 

7.51 

6.03 

0.5 

S-E 

MVE 

26.43 

26.94 

27.11 

8.41 

9.15 

9.54 


S-E 

KSD 

19.51 

17.59 

17.4 

5.58 

5.83 

6.23 


MM-Bisq. 

MVE 

31.22 

31.82 

31.98 

10.73 

11.86 

12.44 


MM-Bisq. 

KSD 

13.24 

12.28 

13.22 

4.85 

4.64 

4.33 


MM-Opt. 

MVE 

24.66 

24.07 

23.96 

6.64 

6.87 

7.08 


MM-Opt. 

KSD 

10.10 

7.59 

8.04 

2.57 

2.22 

2.49 


r-Bisq. 

MVE 

29.98 

31.16 

31.59 

10.20 

11.54 

12.24 


r-Bisq. 

KSD 

19.21 

17.72 

17.29 

6.55 

6.59 

6.97 


r-Opt. 

MVE 

23.81 

23.66 

23.52 

6.45 

6.75 

6.94 


r-Opt. 

KSD 

10.31 

8.21 

8.56 

2.72 

2.38 

2.68 


Rocke 

MVE 

22.34 

21.14 

20.61 

5.04 

5.10 

5.17 


Rocke 

KSD 

11.17 

6.82 

6.70 

1.97 

1.77 

1.68 


S-D 

Subs. 

17.25 

13.76 

12.49 

4.66 

3.68 

3.34 


S-D 

KSD 

13.88 

10.50 

9.55 

3.26 

2.67 

2.53 


KSD 


20.76 

14.62 

7.87 

4.75 

4.43 

2.36 


MCD 


30.80 

22.85 

17.43 

11.54 

6.03 

5.13 


MVE 


39.66 

37.56 

30.43 

7.38 

9.86 

8.53 


Table 11: Simulation: Maximum mean D for scatter and location with p = 15 
and 20% contamination 
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Start 


Scatter 



Location 



n = 


100 

200 

400 

100 

200 

400 

0 

S-E 

MVE 

12.30 

11.03 

10.69 

1.91 

1,79 

1.71 


S-E 

KSD 

10.78 

10.03 

10.06 

1.86 

1.72 

1.64 


MM-Bisq. 

MVE 

8.82 

8.14 

7.93 

1.83 

1.72 

1.65 


MM-Bisq. 

KSD 

7.71 

7.41 

7.36 

1.60 

1.59 

1.50 


MM-Opt. 

MVE 

5.78 

4.16 

3.52 

0.66 

0.59 

0.56 


MM-Opt. 

KSD 

U5 

3.32 

2.77 

0.54 

0.49 

0.47 


r-Bisq. 

MVE 

14.23 

14.11 

13.65 

2.98 

2.94 

2.68 


r-Bisq. 

KSD 

13.58 

12.93 

12.56 

2.95 

2.68 

2.52 


r-Opt. 

MVE 

6.42 

5.08 

4.58 

0.80 

0.74 

0.73 


r-Opt. 

KSD 

5.91 

4.59 

4.10 

0.74 

0.68 

0.68 


Rocke 

MVE 

4.81 

2.90 

1.80 

0.45 

0.30 

0.17 


Rocke 

KSD 

4.01 

2.49 

1.14 

0.39 

0.28 

0.11 


S-D 

Subs 

6.43 

4.81 

4.07 

0.79 

0.71 

0.66 


S-D 

KSD 

5.73 

3.00 

1.80 

0.71 

0.46 

0.34 


KSD 


10.47 

6.69 

2.35 

1.63 

1.52 

1.49 


MCD 


23.11 

16.46 

12.85 

3.70 

2.64 

1.93 


MVE 


11.01 

6.20 

3.79 

0.82 

0.55 

0.37 

0.5 

S-E 

MVE 

15.00 

9.78 

9.02 

1.98 

1.50 

1.49 


S-E 

KSD 

9.97 

9.21 

9.05 

1.48 

1.43 

1.41 


MM-Bisq. 

MVE 

17.17 

11.07 

8.19 

2.65 

1.89 

1.56 


MM-Bisq. 

KSD 

7.12 

6.39 

6.36 

1.18 

1.12 

1.10 


MM-Opt. 

MVE 

13.26 

8.22 

5.86 

1.67 

1.11 

0.88 


MM-Opt. 

KSD 

5.13 

3.46 

3.16 

0.64 

0.53 

0.52 


r-Bisq. 

MVE 

16.97 

11.95 

10.82 

2.61 

2.09 

2.02 


r-Bisq. 

KSD 

11.14 

9.84 

9.62 

1.81 

1.76 

1.70 


r-Opt. 

MVE 

13.11 

8.16 

5.84 

1.66 

1.11 

0.88 


r-Opt. 

KSD 

5.75 

4.41 

3.92 

0.73 

0.66 

0.65 


Rocke 

MVE 

12.08 

7.27 

5.01 

1.41 

0.90 

0.68 


Rocke 

KSD 

4.77 

3.23 

2.28 

0.55 

0.43 

0.36 


S-D 

Subs 

5.57 

3.82 

3.03 

0.65 

0.52 

0.48 


S-D 

KSD 

5.20 

2.47 

1.37 

0.58 

0.37 

0.27 


KSD 


10.45 

3.78 

2.55 

1.63 

1.53 

1.49 


MCD 


20.94 

15.20 

12.45 

3.41 

2.42 

1.86 


MVE 


23.32 

13.14 

8.03 

2.29 

1.51 

1.01 


Table 12: Simulation: Maximum mean D for scatter and location with p = 20 
and 10% contamination 
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Start 


Scatter 


Location 


n = 


100 

200 

400 

100 

200 

400 

0 

S-E 

MVE 

36.13 

35.50 

32.45 

13.83 

12.71 

11.60 


S-E 

KSD 

27.25 

25.41 

25.38 

12.24 

11.95 

10.64 


MM-Bisq. 

MVE 

40.38 

38.61 

34.9 

17.34 

16.76 

16.20 


MM-Bisq. 

KSD 

22.49 

16.86 

16.18 

8.59 

7.57 

7.38 


MM-Opt. 

MVE 

25.90 

23.3 

20.33 

6.97 

6.33 

5.92 


MM-Opt. 

KSD 

14.50 

7.90 

7.49 

3.57 

2.22 

2.38 


r-Bisq. 

MVE 

42.40 

40.6 

38.66 

20.22 

19.95 

19.02 


r-Bisq. 

KSD 

35.82 

34.95 

34.15 

19.59 

18.24 

17.57 


r-Opt. 

MVE 

26.75 

25.36 

21.39 

7.86 

7.06 

6.39 


r-Opt. 

KSD 

16.60 

12.62 

11.11 

4.17 

3.92 

3.60 


Rocke 

MVE 

5.89 

3.47 

2.62 

0.53 

0.28 

0.25 


Rocke 

KSD 

6.70 

3.17 

2.21 

0.52 

0.35 

0.24 


S-D 

Subs 

34.50 

32.90 

31.81 

13.23 

12.30 

13.43 


S-D 

KSD 

92.02 

17.09 

15.55 

18.31 

5.76 

5.33 


KSD 


24.12 

14.83 

13.87 

9.08 

6.04 

6.35 


MCD 


39.35 

32.04 

25.62 

15.00 

12.50 

8.73 


MVE 


56.38 

55.81 

41.65 

20.88 

18.15 

14.97 

0.5 

S-E 

MVE 

53.91 

52.01 

51.12 

15.26 

17.78 

17.49 


S-E 

KSD 

27.22 

23.98 

23.80 

9.9 

8.72 

8.18 


MM-Bisq. 

MVE 

65.10 

64.73 

63.46 

26.9 

25.70 

25.05 


MM-Bisq. 

KSD 

20.18 

19.50 

18.77 

8.82 

8.25 

7.84 


MM-Opt. 

MVE 

47.23 

46.63 

45.78 

14.67 

13.81 

13.35 


MM-Opt. 

KSD 

14.82 

11.78 

11.90 

3.72 

3.52 

3.69 


r-Bisq. 

MVE 

64.71 

63.58 

63.18 

27.61 

26.95 

26.71 


r-Bisq. 

KSD 

29.57 

28.72 

28.29 

14.83 

13.41 

12.42 


r-Opt. 

MVE 

47.06 

46.44 

45.78 

14.86 

13.82 

13.44 


r-Opt. 

KSD 

15.75 

12.76 

12.26 

4.77 

4.03 

3.95 


Rocke 

MVE 

36.97 

35.31 

34.21 

7.69 

7.41 

7.23 


Rocke 

KSD 

12.53 

9.35 

8.54 

2.38 

1.96 

1.86 


S-D 

Subs 

21.43 

19.15 

18.72 

6.68 

6.71 

6.50 


S-D 

KSD 

17.34 

13.67 

12.44 

4.54 

3.71 

3.49 


KSD 


19.94 

16.57 

10.77 

7.73 

7.58 

3.54 


MCD 


35.13 

27.90 

23.71 

13.99 

10.24 

7.70 


MVE 


65.70 

64.06 

50.84 

18,39 

16.73 

13.88 


Table 13: Simulation: Maximum mean D for scatter and location with p = 20 
and 20% contamination 
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Start 


Scatter 



Location 



n = 


150 

300 

600 

150 

300 

600 

0 

S-E 

MVE 

17.90 

17.03 

16.76 

2.64 

2.91 

3.04 


S-E 

KSD 

20.13 

18.39 

17.76 

2.63 

2.87 

3.03 


MM-Bisq. 

MVE 

14.05 

13.81 

13.59 

2.59 

3.24 

3.51 


MM-Bisq. 

KSD 

12.02 

11.80 

11.35 

2.64 

2.56 

2.38 


MM-Opt. 

MVE 

8.89 

7.44 

5.79 

0.87 

0.93 

0.84 


MM-Opt. 

KSD 

7.03 

5.34 

4.25 

0.73 

0.71 

0.65 


r-Bisq. 

MVE 

27.13 

28.5 

29.22 

5.81 

7.22 

8.04 


r-Bisq. 

KSD 

29.08 

28.86 

27.05 

7.79 

7.65 

7.19 


r-Opt. 

MVE 

10.81 

8.88 

8.34 

1.22 

1.24 

1.31 


r-Opt. 

KSD 

10.14 

8.56 

8.15 

1.15 

1.19 

1.28 


Rocke 

MVE 

5.95 

3.60 

2.72 

0.35 

0.23 

0.18 


Rocke 

KSD 

6.27 

3.03 

1.71 

0.33 

0.22 

0.13 


S-D 

Subs 

13.08 

10.12 

9.28 

1.65 

1.49 

1.42 


S-D 

KSD 

8.97 

4.64 

2.71 

1.01 

0.66 

0.48 


KSD 


15.98 

7.97 

3.43 

1.30 

1.01 

0.5 


MCD 


22.47 

17.66 

12.42 

3.41 

2.68 

2.17 


MVE 


15.9 

9.42 

5.36 

1.11 

0.89 

0.53 

0.5 

S-E 

MVE 

24.5 

20.59 

16.14 

3.37 

3.04 

2.57 


S-E 

KSD 

17.92 

15.72 

14.91 

2.20 

2.29 

2.36 


MM-Bisq. 

MVE 

34.03 

30.11 

21.75 

5.84 

5.82 

4.54 


MM-Bisq. 

KSD 

11.44 

10.19 

9.72 

1.78 

1.88 

1.91 


MM-Opt. 

MVE 

23.48 

19.8 

14.01 

2.81 

2.57 

1.95 


MM-Opt. 

KSD 

8.05 

5.30 

4.82 

0.89 

0.71 

0.71 


r-Bisq. 

MVE 

33.97 

30.14 

23.06 

6.01 

6.02 

5.02 


r-Bisq. 

KSD 

21.64 

21.32 

21.15 

4.73 

4.36 

3.89 


r-Opt. 

MVE 

24.15 

20.62 

14.84 

2.94 

2.75 

2.15 


r-Opt. 

KSD 

9.87 

8.22 

7.66 

1.27 

1.23 

1.20 


Rocke 

MVE 

18.49 

14.16 

9.42 

1.73 

1.36 

0.95 


Rocke 

KSD 

7.03 

3.63 

2.82 

0.57 

0.33 

0.30 


S-D 

Subs. 

11.02 

8.59 

7.17 

1.31 

1.11 

1.03 


S-D 

KSD 

8.18 

3.62 

1.87 

0.87 

0.4 7 

0.33 


KSD 


17.46 

5.96 

3.72 

1.64 

0.75 

0.55 


MCD 


19.46 

16.35 

12.34 

2.99 

2.47 

1.80 


MVE 


33.78 

22.49 

12.42 

2.74 

2.21 

1.19 


Table 14: Simulation: Maximum mean D for scatter and location with p = 30 
and 10% contamination 
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Start 


Scatter 



Location 



n = 


150 

300 

600 

150 

300 

600 

0 

S-E 

MVE 

62.17 

73.37 

76.03 

21.85 

28.44 

30.79 


S-E 

KSD 

54.71 

49.14 

47.32 

14.65 

15.51 

16.19 


MM-Bisq. 

MVE 

77.51 

110.69 

121.81 

31.98 

55.85 

67.37 


MM-Bisq. 

KSD 

41.01 

28.82 

27.49 

17.35 

15.02 

14.08 


MM-Opt. 

MVE 

46.42 

59.67 

65.08 

11.8 

16.91 

19.82 


MM-Opt. 

KSD 

21.08 

14.21 

14.54 

5.04 

3.79 

4.33 


r-Bisq. 

MVE 

94.54 

115.61 

120.78 

46.25 

64.34 

71.33 


r-Bisq. 

KSD 

62.96 

60.82 

60.14 

36.87 

33.54 

32.64 


r-Opt. 

MVE 

58.01 

68.3 

70.63 

15.62 

21.06 

23.21 


r-Opt. 

KSD 

28.27 

20.71 

20.46 

7.32 

6.97 

6.65 


Rocke 

MVE 

7.11 

6.86 

10.96 

0.46 

0.49 

1.42 


Rocke 

KSD 

14.52 

5.61 

3.37 

0.92 

0.35 

0.24 


S-D 

Subs. 

38.76 

37.38 

37.03 

17.28 

17.07 

18.04 


S-D 

KSD 

98.67 

29.66 

24.07 

13.45 

11.19 

10.13 


KSD 


64.17 

36.25 

30.29 

14.52 

10.31 

10.97 


MCD 


46.91 

34.02 

29.67 

16.31 

12.55 

10.68 


MVE 


177.47 

149.86 

137.23 

44.59 

50.36 

51.49 

0.5 

S-E 

MVE 

76.66 

78.43 

79.66 

24.39 

26.77 

28.17 


S-E 

KSD 

47.95 

42.22 

41.57 

13.65 

12.97 

12.73 


MM-Bisq. 

MVE 

103.92 

109.61 

111.5 

42.06 

50.14 

53.86 


MM-Bisq. 

KSD 

37.47 

32.15 

27.12 

14.03 

13.63 

12.31 


MM-Opt. 

MVE 

73.64 

75.91 

77.27 

19.38 

21.86 

23.36 


MM-Opt. 

KSD 

19.88 

18.77 

15.48 

6.77 

5.24 

4.85 


r-Bisq. 

MVE 

103.88 

109.08 

111.6 

44.1 

51.94 

56.28 


r-Bisq. 

KSD 

52.86 

50.44 

48.78 

24.05 

23.25 

22.98 


r-Opt. 

MVE 

75.98 

78.91 

80.83 

20.69 

23.80 

25.77 


r-Opt. 

KSD 

29.07 

25.77 

20.64 

7.39 

7.21 

6.79 


Rocke 

MVE 

50.49 

44.86 

41.68 

7.78 

6.99 

6.33 


Rocke 

KSD 

23.09 

14.86 

9.76 

3.12 

2.18 

1.43 


S-D 

Subs. 

25.23 

23.01 

22.12 

10.14 

9.08 

8.04 


S-D 

KSD 

22.31 

19.56 

17.10 

6.64 

6.23 

5.80 


KSD 


58.96 

40.57 

28.03 

15.26 

13.64 

10.25 


MCD 


42.37 

29.51 

22.62 

13.27 

10.58 

9.43 


MVE 


100.85 

97.21 

79.25 

16.85 

24.84 

21.56 


Table 15: Simulation: Maximum mean D for scatter and location with p = 30 
and 20% contamination 
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